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Abstract A basic belief assignment can have up to 2 ra focal elements, and 
combining them with a simple conjunctive operator will need 0(2 2n ) opera- 
tions. This article proposes some techniques to limit the size of the focal sets 
of the bbas to be combined while preserving a large part of the information 
they carry. 

The first section revisits some well-known definitions with an algorithmic 
point of vue. The second section proposes a matrix way of building the least 
committed isopignistic, and extends it to some other bodies of evidence. The 
third section adapts the fc-means algorithm for an unsupervized clustering of 
the focal elements of a given bba. 
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1 General considerations on basic belief assignments 

Let the finite set X = {x%, . . . , x n } be our frame of discernment. The size of 
X will be noted n = \X\. The set of all the subsets of X will be noted 2 X . 

Definition 1. |Shafer(1976)| The application m from 2 X to [0,1] is a basic 
belief assignment (bba) if : 

E m ( A ) = 1 (!) 

ACX 

The constraint of closed world is modeled by m(0) = 0. If rn(0) is greater 
than 0, we either have an open world or a conflict within the information. 
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Definition 2. Let m be a bba on X. A C X is a /oca/ element of m if 
m(A) > 0. The /oca/ set of m is composed of all its focal elements : 

F(m) = {ACX\ m(A) > 0} (2) 

The size of m is noted \m\ = Card(-F(m)). 

Of course, \m\ ^ 2". In most applications, \m\ will be very small compared 
to 2™ when a bba is constructed from a source's information, but after some 
steps of combination, this limit can be reached. 

Definition 3. Let m be a bba on X . The most usual bodies of evidence are : 

• The belief: ^-^ ^-^ 

bel(A) = m ( B ) = E ™( B ) ( 3 ) 

BCA, BCA, 
BeF(m) 

• The plausibility: 

pl(A)= m ^= E m ^ w 

BeF(m) 

• The commonality: 

q(A) = ]T = £ m(B) (5) 



BDA 



BDA, 
BeF(m) 



• The pignisitic probability, which is additive (knowing betP({cc}) for all 
x 6 X is sufficient): 

betpM) = l — Y \Af^l m (B) = 1 y \All_ B i m{ B) (6) 

When the context is not obvious, the bba used to define the body of evidence 
will be placed as an index : betP m (A) instead of betP(m). 

In the definition [3j the first expression concerns all the subsets of X, and 
the second expression concerns only the focal elements. Therefore, if / is 
either of the bodies of evidence, and A a subset of X, a natural implemen- 
tation of the equation brings an algorithm which calculates f(A) in 0(2 n ) 
operations with the first expression. As the second expression only browses 
the focal set of m, its complexity is C(|m|), for the same result. 

The most popular combination operator is the non-normalized conjunctive 
rule, also known as Smet's rule. It is a quite simple operator to implement; 
it is associative, and therefore allows to combine many sources. 
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Definition 4. Let mj and 777.2 be two bbas on X . The conjunctive combina- 
tion of 777,! and m 2 is a bba on X , mi © 777 2 , denned by : 

( mi (Bm 2 )(A)= Yl m 1 (B)m 2 {C)= ^ 777i(B)m 2 (C) (7) 

BCX, BEF(mi), 
CCX, CeF{m 2 ), 
BnC=A BnC=A 

The cost for calculating B n C is 0(77). The first expression brings an al- 
gorithm in O (t72 2 ™) operations for calculating (mi © 7772) (A), and O (t72 3 ™) 
for determining mi © m 2 . The second expression brings an algorithm in 
0(n|mi||m2|) operations for calculating (mi © m 2 )(A) = (7711 © 7772) (^4), 
and O (77,2™|mi||m2|) for determining mi © 777,2. 

Smets [Smets(2002)| proposed a nice implementation in 0(n2 n ) operations 
for transformations between bba and commonality. The conjunctive combina- 
tion of the commonality functions is a simple multiplication, which is linear, 
but on vectors having a size of 2™. 

The expression ([7]), nor the commonality, can prevent us from making 
operations on non- focal elements of mi ©7772. Let the bba be implemented by 
an adaptive structure that contains information only for its focal elements. A 
hashtable is a convenient way for it. The algorithm [1] uses only C(77|mi | |m2|) 
to build mi © 7772. 

The size of mn is at most |mi | |m2 1 . The algorithm coming from ([7]) needs to 
be executed for all the subsets of X, but the algorithm[T]only works on the fo- 
cal elements of m n , and does not compute useless intersections |Smets(1994)] . 
Using a hashtable for the focal elements, with a hashcode calculation in 0{n) 
operations, the conjunctive combination takes 0(?7,|mi | |m2|) operations. 

Data: bbas mi, 777,2 
Result: bba m n 
forall B E ml do 
forall C G 7772 do 

if B n C <E m n then 

m n (B nC)MTi n (BnC) + m 1 (B)m 2 {C) 
else 

Add B n C to m n 
m n (BnC) <- mi( J B)m 2 (C*) 

Algorithm 1: Conjunctive combination 

However, the very nature of the combination operator brings a combina- 
torial explosion of the focal set. Let m t be the bba defined by mi (X) = ^ 
and mi(X\{xi}) = ^: \mi\ = 2. Let m n be the conjunctive combination of all 
those bbas : m n = mi © ... © m„. For any ACI, m n (A) — Therefore, 
F(m n ) = 2 X and |m n | = 2™. 

The objective of the following sections will be to guarantee that the size 
of a bba cannot be too large, and to respect its nature as much as possible. 



4 



Christophe OSSWALD 



2 Linear algebra for bbas 

The definition [3] builds the bodies of evidence bel, pi, betP and q as linear 
transformations of m. Considering a bba m on X and an integer K, our 
objective will be to build an bba m! on X such that \m'\ ^ K and f m i(A) = 
fm(A) for some bodies of evidence / and some subsets A of X. 

Within this section, we forbid to be a focal element of m, and we do 
not allow it to become a focal element of m! . As convenient consequences, we 
have bel(A) ^ betP(A) «C pl(A), bel(X) = 1, and pl(X) = 1. 

A popular and efficient way to build a bba from a probability or another 
source of uncertain information is to build a least committed bba having the 
same pignistic probability than the source [Smets(1990j] . 

Definition 5. Let m be a bba on X. A bba m' is an isopignisitic of m if 



The bba m' is the least committed isopignistic of m if for any isopignistic m" 
of m and for any A C X, p\ rn ,(A) ^ p\ m „(A). 

The algorithm [2] builds the least committed isopignistic in 0(n 2 + n\m\) 
operations. It contains at most n focal elements. 



Data: bba m on X 
Result: bba m! on X 
forall i£ldo 

Calculate p[i] = betP(ar) 
A <— X; k \X\ 
while max(p) ^ do 

i <— argmin(p) 

m'(A) <- kp[i] 

forall j £ p do 

p[j] ^P[j] ~p[i] 
Delete element i from p 
A <- A\{xi}; 

Algorithm 2: Building the least committed isopignistic 

If we calculate betP(x) for all x € X, and order the elements of X such 
that pi = betP(iEi) ^ betP(x,+i) = Pi+i, the focal elements of the least 
committed isopignistic are a subset of the Ai — {x\, . . . , x{\. 



Vx 6 X, betP m (x) = betP m '(x) 



(8) 



We have 




(9) 



Let p be the vector of the pi and y be the vector of the m'(Ai). We 
have p = Bety with Bet a n x n matrix, triangular and inversible. Therefore 
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Bet p, with 
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from 



As Bet -1 is a triangular band matrix, we can compute all the m'(Ai 
Pi in 0(n) operations. 

With 0(n|rn|) operations for computing betP, O(nlnn) operations for 
sorting A, 0(n) operations for building the sets Ai (with an adapted 
data structure) and 0(n) operations for solving the linear system, building 
the least committed isopignistic costs 0(n(lnn + \m\)) operations. Usually, 
| rn | » Inn, and the cost of the least committed isopignistic is not greater 
than the cost of computing betP(x) for the elements of X. 



The interval [bel(A), pl(A)], containing betP(A), can be interpretated as 
an uncertainty on A |Janez and Appriou(1996)] . For singletons, bel is trivial: 
bel(x) = m(x). For sets of size n — 1, pi is trivial: pl(X\{x}) = 1 — m({x}). 
Considering the non-trivial bodies of evidence on the sets of interest {xi}, 
. . . , {x n }, B\ = X\{xi}, . . . , B n = X\{x n }, we search a bba to' with those 
focal elements, forming a vector 

y = (m'({ Xl }), . . . , m'({x n }), m'{B x ), . . . , m'{B n )) T (11) 
which verifies: 

We U, «J, p\ m ,{{x i }) = p\ m {{x i }) (12) 

Vi e |l,nj, beW(Si) = bel m (Si) (13) 

We have: 

pl m ,({ Xi }) = m'{{x i })+Y,m'{B j ) (14) 

beW(B 4 ) = ^m l {{x j })+m'{B i ) (15) 

As Vi, p\ rn ,({xi}) +be\ m ,(Bi) = J^i m'({xi}) + Y,i m '( B j)> tnere are onl y 
n + 1 independent equations among the 2n listed above: we cannot guarantee 
to kep at the same time pl m ({xi}) and bel m (Bi) on those 2n focal elements. 

As q(-Bi) = m(Bi) + m(X) and q({xi}) = pl({a;i}), introducing common- 
ality does not bring any new independent equation. 
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2.1 Mixing Bet with other bodies of evidence 



Here we search a bba with 2n focal elements which is an isopignistic of m and 
respects an other body of evidence on some focal elements. In the following 
examples, we allow the Ai obtained in section [2] to be focal elements, and we 
complete them with {{xi})i^u in \ or the (-Bj)j e ii n j. 



With plausibility, we should use the focal elements ({^i})ie|i, n J 
build a vector 



y = K({xi}), • • • , m'({x n }), m'{A x ), . . . , m'{A n )) 
The constraints are: 

" 1 

betP(x 4 ) = m'({ Xi }) + rm'(A k ) 



k—i 



pl({ Xi }) = m'({ Xi }) +Y,™'{A k ) 



We 
(16) 

(17) 
(18) 



As Ai — {xi}, we cannot have m'(Ai) ^ m'({xi}); we have only 2n — l focal 
elements. We drop the term m'({xi}) in y, and the constraint on pl({a;i}) to 
obtain a matrix P such that 



Py = (pUW), • • ■ , pl m ({z 2 }), betP(xi) 
The matrix P4 and more generally P n are: 



fo 1 i f i\ 

1 \ I ; 
1 I ; 



Pi 



1 \ 
10 111 
10 11 
\0 1 I J 




(19) 



(20) 



where Bet„ is matrix obtained in the section[2]and C/„_i the upper triangular 
(n— 1) x (n— 1) matrix full of 1. 

The matrix P n is inversible, and we can solve this system in 0(n 3 ) oper- 
ations. Overall, we can reduce the focal set of m to 2n — 1 focal elements in 
0(n(n 2 + \m\)) operations, respecting betP and pi on the singletons. 

With commonality, we obtain the same results : q({xi}) — p\({xi}). 

With belief, we should use (i?^) ie [i,ri.j as focal elements instead of ({xi}). 
As be\(E>i) +pl({£i}) = 1, we obtain another - but similar - (2n— l)x(2n— 1) 
inversible matrix. 
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3 Optimatization by fc-means 



Denoeux and Yaghlane(2002)| proposed to reduce a bba by adapting the sin- 



gle linkage hierarchical clustering algorithm to coarsen its focal set. Another 
interesting family of unsupervized clustering algorithm are the fc-means tech- 
niques, born from the ISODATA method of Ball and Hall(1965)| . One can 



adapt this method to hnd a subset K. of 2 X limited in size: \K,\ ^ fc. 

Usual fc-means does not guarantee an optimal choice of centers: finding 
them is equivalent to the MiNiMUM-fc center, which is a NP-Complcte prob- 
lem Garey and Johnson(1979) . The convergence of the fc-means algorithm 



is guaranteed, but only to a local minimum of the intra-cluster variance. 



Data: bba to, integer fc with fc ^ |m| 
Result: bba rrifc 

Let C[l], . . . , C[fc] be fc focal elements of to [1] 
repeat 

forall j fc do C[j] 

forall A S to do [2] 

C[argmin(dist(A, Cj))] 4- C[argmin(dist(/1, Cj))] U {A} 
forall j ^ fc do C[j] <- center of C[j}; [3] 
until ending condition reached [4] 
forall j ^ fc do 

m k (C[j}) <- T,AeC{j] m i A ) 



Algorithm 3: fc-mcans, in a general way that applies to focal elements. 

[1] It is natural to initialize the algorithm with the fc focal elements with 
the greatest masses. But, as the algorithm converges - if it converges - to 
a local minimum, it should be a good idea to execute various instances, 
with random starting sets. 

[2] The focal element A is affected to the center C[j] such that 



dmt(A,C[j})= (AnC\j]) u(AnC[j}) (21) 

is minimal. It corresponds to a natural Li distance based on an exclusive 
OR. In case of equal distances to different centers, it is possible to: 

• choose a random one ( the algorithm is no longer deterministic ) 

• use a lexicographical order (elements are no longer equivalent) 

• try to build balanced clusters (the underlying problem is NP-complete) 

[3] The usual fc-means technique uses the geometrical barycenter of the 
focal sets of C[j] seen as points of [0, 1]™ : C\j\ <— Y^AecU] Tn-CtyA. 
It would build fuzzy focal elements, which is not the way the definition [2] 
accepts them. Therefore, we put x in the new C[j] if and only if : 



E 



m(A) > 2^ m(A) (22) 



Aec[j},xeA Aec[j],x£A 
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[4] As we "move" the centers of the classes to the nearest sharp subset of 
X, the total intra-cluster variance is not necessarily decreasing. Therefore, 
the ending condition must include a maximum steps number, and/or test 
the cycles it should encounter. 

A step of the algorithm [3] costs 0(kn\m\) operations. A reasonable number 
of steps before ending the loop is k, and we obtain an algorithm in 0{k 2 n\m\) 
operations. If we want to compare this approach with the ones of the section 
12. 1[ we should use k = 2n— 1, and get an algorithm in C(n 3 |m|) operations. 



4 Conclusion 

In a general way, dealing with basic belief assignments on large frames of 
discernment need a proper encoding of the focal sets. We propose to use 
hashtables for this purpose, but this not the only way. We propose two cate- 
gories of methods for restricting any bba to a bba modest in focal set size. 

We extend the principle of isopignistic to other bodies of evidence to build 
a bba with only 2n— 1 focal elements, respecting both the pignistic probability 
and another body of evidence of the original bba. We first determine the value 
of the bodies of evidence on some simple elements, and then determine the 
restricted focal set. A linear equation gives the restricted bba. 

Trying to restrict the focal set to a number of respresentativc elements 
leads to a NP-Complete problem. We adapt the A;-mean algorithm to build 
a heuristical solution. It is more expensive, but it does not need to define a 
priori a focal set, and can adapt to more situations. 
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